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chiral symmetries, are known to suffer from sign problems when formulated using the auxiliary field approach. 
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I. INTRODUCTION 

Four-fermion field theories are interesting in both con- 
densed matter and particle physics. The well known Hub- 
bard model and its variants are often used in studying cuprate 
superconductors [ 1 ], antiferromagnets J2) and more recently 
graphene O. Low energy nuclear physics is also studied with 
four-fermion couplings in the effective field theory framework 
(4] 13. In the context of more fundamental theories like QCD, 
four-fermion field theories offer a simpler setting to study phe- 
nomena like fermion mass generation and chiral symmetry 
breaking [6 1. It has been suggested recently that quantum crit- 
ical phenomena in graphene can be studied with four-fermion 
field theories ||7][8). Despite the wide interest, strongly cou- 
pled four-fermion field theories remain poorly understood as 
compared to their bosonic counterparts due to computational 
difficulties. 

The only available method to compute quantities in a 
strongly interacting field theory with no small parameter is 
the Monte Carlo (MC) method. Due to the quantum nature 
of a fermion there are no natural fermion configurations with 
positive weights that can be used for important sampling. In 
two space-time dimensions fermions can often be bosonized 
and models can we written in terms of world line configura- 
tions with positive weights. This fact can be used to design 
powerful MC methods [9-11 1. In higher dimensions, the tra- 
ditional MC approach is to integrate the fermions out in fa- 
vor of a determinant of a large fermion matrix. Whenever 
this determinant is positive a non-local probability distribu- 
tion emerges, which can be used to construct a MC method. 
The most popular is the Hybrid Monte Carlo (HMC) method 
lfL2l [13 1 which has continued to evolve in many ways since 
its discovery fl4l . Unfortunately, small eigenvalues of the 
fermion matrix which naturally arise in the presence of mass- 
less fermions can cause singularities in the HMC approach. 
This makes it difficult to study quantum critical phenomena 
containing massless fermions. While other determinantal MC 
methods do not encounter such problems, they scale poorly 
with system size ifTSIl . In cases where the determinant of the 
fermion matrix is not positive, the original theory is said to 
suffer from a sign problem and the traditional approach is not 
useful. The repulsive Hubbard model away from half filling 



is a classic example where progress has been limited due to 
sign problems. Other relativistic four-fermion field theories 
like the Gross-Neveu (GN) models and Nambu-Jona-Lasinio 
(NJL) models are also known to suffer from sign problems in 
three or more space -time dimensions fl6l . 

Recently a new approach called the fermion bag approach 
was proposed to solve some four-fermion field theories ifTTl - 
[T9l . It is an extension of the meron cluster idea proposed some 
time ago ll20l . The idea behind the fermion bag is to identify 
fermion degrees of freedom that cause sign problems and col- 
lect them in a bag and sum only over them. This is in con- 
trast to traditional approaches where all fermion degrees of 
freedom in the entire thermodynamic volume are summed to 
solve the sign problem. When the fermion bag contains only 
a small fraction of all the degrees of freedom and the summa- 
tion can be performed quickly, the fermion bag approach can 
be used to design powerful MC methods. Sometimes, the bag 
splits into many disconnected pieces further simplifying the 
calculation. The fermion bag approach has three main advan- 
tages: (a) Due to a duality, fermion bag sizes are small both at 
weak and strong couplings, (b) Singularities in the massless 
limit can be tackled without a problem, (c) Some sign prob- 
lems that haunt traditional approaches are naturally solved. 
While the first two advantages have been demonstrated, the 
third advantage is not so clear from previous work. Here we 
show how solutions to some unsolved sign problems in four- 
fermion models also emerge naturally in the fermion bag ap- 
proach. 

It is useful to clarify some confusions that may arise about 
what we mean by a sign problem and thus a solution to the 
sign problem. If one can write the partition of a quantum sta- 
tistical mechanics system as a sum over configurations whose 
Boltzmann weights are all positive and if the cost of computa- 
tion of the Boltzmann weights only scales as a polynomial in 
system size, then we say the model does not suffer from a sign 
problem. However, as already stated above, in fermionic sys- 
tems there are no natural configurations where the Boltzmann 
weights are positive. The conventional method is to use the 
auxiliary field approach to expand the partition function as a 
sum of bosonic configurations where the fermion determinant 
is taken as part of the Boltzmann weight. If this weight can 
be negative one often says the model suffers from a sign prob- 
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lem. However, if an alternate approach can be found where 
the sign problem disappears, one can of course say the prob- 
lem never suffered from a sign problem to begin with. On the 
other hand, if this alternate approach was not known earlier, 
the new approach can be considered as a solution to the sign 
problem present in the other method. This is what we mean 
when we say "solutions to unsolved sign problems". It must 
be noted that all sign problems are problems in exactly this 
sense. Once a solution is found there is no longer a problem. 
It is of course likely that some problems may remain unsolved 

EU. 

We consider lattice GN models containing N flavors of 
massless staggered fermions with either a Z 2 or a U (1) chiral 
symmetry fl6l . While we work in three space-time dimen- 
sions, our results can easily be extended to higher dimensions. 
Although in three dimensions the symmetries we refer to are a 
part of a flavor symmetry, they are often loosely called chiral 
symmetries in the literature. The Z 2 models with odd N and 
all the U(l) models are known to suffer from a sign problem 
when formulated in the traditional auxiliary field approach. 
Here we show that the sign problems disappear in the fermion 
bag approach. Our paper is organized as follows. In section 
2 we review the auxiliary field approach to lattice GN models 
with both Z 2 and U(l) chiral symmetries and discuss how the 
sign problems arise. In section 3 we discuss the severity of 
the sign problems. In section 4 we discuss the fermion bag 
approach and show that sign problems do not arise. Section 5 
contains our conclusions. 



Following [ 16 1, we define the auxiliary fields on dual sites 
x. The model with a Z2 chiral symmetry is defined through a 
single real auxiliary field a(x), such that 



(4a) 
(4b) 



{x,x) 



while the model with a (7(1) chiral symmetry requires two 
real auxiliary fields a(x) and ir(x), such that 

S AF [a,n] = ^]T(a 2 (i) + 7r 2 (£)), (5a) 

" X 

(x,x) 

where e(x) is the parity of a lattice site (1 on even sites and 
— 1 on odd sites). In the above expressions, the set of nearest 
dual sites x surrounding the fixed lattice site x is denoted as 
(x, x) (see Fig. TJ. In this work we only consider these two 
classes of mode s. 




H. AUXILIARY FIELD APPROACH 

Lattice GN models are formulated in the auxiliary field ap- 
proach through the action 

Xt{y) + s AF (i) 

x,y,i 

where Xi{ x )iXi( x ) denote the Grassmann valued fermion 
fields of flavor i — 1,2.., N at the lattice site x. The ex- 
plicit form of the auxiliary field action Saf depends on the 
GN model and will be discussed below. The matrix D[ip\ is 
defined by 

(D[$]) xy = D xy + 6 xy 0(x), (2) 

where <p(x) is a function of the auxiliary fields as defined be- 
low and D XiV is the free staggered fermion matrix Il22l424l . 

D x ,y m&x,y ~t~ ^ ^ ^ [^:c+a,y &x,y+a] • (3) 

a=l,2,3 

Since we work in three dimensions, a labels the three di- 
rections, rk.a = e^'^Ci = (0,0,0), C2 = (1,0,0), 
£3 = (1,1,0) are the staggered fermion phase factors and m 
is the bare fermion mass. We assume anti-periodic boundary 
conditions in all directions and denote the lattice volume by 
V = L 3 . 



FIG. 1. Nearest neighbor lattice sites (open circles) of a fixed dual 
site (filled circle) x is represented by [x, x] (see left figure), while the 
nearest neighbor dual sites of a fixed lattice site is denoted by {St, x) 
(see right figure). 

It is easy to verify that Sqn is invariant under U(N) 
flavor transformations. When m = 0, additional chi- 
ral symmetries emerge. The Z 2 model is invariant under 
Xi(x) e(x)xi(x), Xi(x) -t -Xi[x)e(x), <r{x) -> 
—cr(x) while the U(l) model is invariant under the additional 
U(l) chiral symmetry Xi( x ) ~^ e lE ^ e / 2 Xi(x), Xi(x) — > 
X i (x)e le( ' x ' ,e / 2 , a(x) -> a(x) cos 6 + ir(x) sin 8, tt(x) -> 
7r(x) cos 9—a(x) sin 9. The models contain a quantum critical 
point (QCP) separating a chirally symmetric phase (at small 
couplings) from a phase where the chiral symmetry is spon- 
taneously broken (at large couplings). The symmetries that 
govern the QCP needs proper analysis due to fermion dou- 
bling. Without such an analysis it is difficult to establish the 
continuum field theory that emerges at the critical point [19]. 

In the traditional MC approach, one integrates over the 
Grassmann fields and writes the partition function of the GN 
models as 

f 1 N 

Z Z2 - jpa] e -^FM I D etD($) I , (6a) 

Zu(i) = I [DaVn] e ~ SA ^ \ DetD([$\) \ , (6b) 
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In order to design a MC method the determinant terms in the 
above expressions have to be real and positive. In the Z 2 
model since cj> is real, the matrix elements of D[(j>] are real. 
Hence, the determinant is real but not necessarily positive. In 
the case of the U(l) model, <fi is complex and so the matrix 
elements of D[<f>] and its determinant can be complex. Hence, 
the Z 2 model as formulated in Eq. ( (6a| > suffers from a sign 
problem for all odd values of N, while the U(l) model as for- 



mulated through Eq. ( 6b 1 suffers from a sign problem for all 
values of N. 



An important question to study is whether the HMC algo- 
rithm is getting trapped in the sector of configurations with 
positive weights (or negative weights). Note that, in the Z 2 
models the only way to move from a positive weight sector 
to the negative weight sector is to pass through configurations 
which have almost zero weight assuming the step size in the 
HMC algorithm is small. Perhaps the suppression of the tun- 
neling between the two sectors leads to long auto-correlation 
times or even lack of ergodicity. This argument also applies 
to Z2 models with even N ||29| . 



III. SEVERITY OF THE SIGN PROBLEM 



IV. FERMION BAG APPROACH 



Earlier calculations in the Z 2 and U(l) GN models have 
all been performed in the auxiliary field approach Il25lj27l . 
The essential focus has been to understand the quantum phase 
transition and compute the critical exponents. These calcula- 
tions have circumvented the sign problem by studying even 
N in the Z 2 case or by introducing conjugate fermions with 
an opposite chiral charge in the U(l). Inclusion of conjugate 



fermions changes the partition function from Eq. ( 6b 1 to 



J u(i) 



[Da] [Dir] e 



-Sa[ 



DetD(M) 



27V 



(7) 



and changes the flavor symmetries to U(N) x U(N) while the 
chiral symmetry remains unchanged. 

The N = 1 model with Z 2 chiral symmetry was also 
studied in the auxiliary field approach using the HMC al- 
gorithm ll28l . Strangely, in this study the sign of the de- 
terminant was never discussed and seems to have been ig- 
nored. Since the results of the quantum critical behavior were 
in quantitatively agreement with large N results (improved 
with Pade-approximations), it may have been assumed that 
the sign problem was mild. If this is indeed true then sta- 
tistically, positive sign configurations should dominate over 
negative sign configurations. The Z 2 model studied in 11281 
is slightly different from the model studied here. The auxil- 
iary fields cr(x) also live on the main lattice site and the field 
4> appearing in the Dirac operator D[(j>] of Eq. 0, is defined 
as 4>(x) — g J2( z x) a ( z )< where now { z i x ) refers to the six 
nearest neighbor sites z for a given site x. In order to study 
the sign problem, we generated several Gaussian random aux- 
iliary field configurations according to the distribution 



P(a(x)) = cxp ^ - £ [a 2 {x) - \ log^)} j 



(8) 



and computed Det (£)[</>]) for each of these configurations. We 
then separated the configurations into those with a positive de- 
terminant and those with a negative determinant. In Fig. |2]we 
plot the distribution of configurations with positive and nega- 
tive determinants as a function of log |Det(D[</>])| for 6 3 and 
12 3 lattices. As can be seen, the distribution of configura- 
tions with positive and negative weights are almost identical 
suggesting a severe sign problem rather than a mild one ! Al- 
though we are not performing important sampling, our results 
clearly show that the sign problem must be studied carefully. 



We will now show that the sign problems in both the Z 2 
and the U(l) models discussed in section [IT] disappear in the 
fermion bag approach. The proof relies on the fact that any Ap- 
point correlation function involving the i th flavor of staggered 
fermions defined through 

Xi{xii)Xi(xii) - Xi{xi h .)xi{xi h .) (9) 

is positive semi-definite. This is due to the special properties 
of the free staggered fermion matrix. Indeed, using the ideas 
developed in the fermion bag approach lfl9l . we can write 

C i (af il ,..,:0=Det(D) Deb(G[{x}i]) = Dek(W[{x}i]) 

(10) 

where G[{x}i] is the ki x ki matrix of propagators between 



the ki sites in the set {x}i 



p = 1, .., fcj whose ma- 



trix elements are G XptXq — D Xp 1 Xq and the matrix VF[{a;}i] 
is a (V — ki) x (V — ki) matrix identical to the matrix D 
except that the sites in the set {x}i are dropped from the ma- 



trix. All the determinants appearing in Eq.( 10 1 can be shown 



to be positive (or zero). The simplest way to see this is to 
consider the matrix W. Since it is exactly the same as the 
staggered fermion matrix with some sites removed, its eigen- 
values come in complex conjugate pairs of the form m ± i\. 
Unpaired eigenvalues are always m and they too come in pairs 
when the lattice is bipartite. When m — then the determi- 



nant can be exactly zero. Thus, Ct (x^ , 



) > 0. We will 



use this property to prove the absence of a sign problem in the 
fermion bag approach. 

Instead of integrating out the fermion fields let us integrate 
out the auxiliary fields first and construct the appropriate four 
fermion action for the models. Let us first consider the Z 2 
model. Each integral over the auxiliary field <r(x) on the dual 
site x gives, 

J e = J da{i) e - s ^- £ ¥ 1 (Si,[ 3! ,»iXiWx*W) = tf e -S I (x)^ 

(11) 

where Af = v /2ng 2 /N and 



Si(x) 
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|- , 2 

[ Xi{x)Xi(x)\ , 



(12) 
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FIG. 2. Distributions of positive (left graphs) and negative (right graphs) weight configurations as a function of log | Det (D [4>] ) | . One million 
configurations and 5000 configurations were generated at 6 3 and 12 3 lattices respectively. The distribution of positive configurations is almost 
identical to the one with negative configurations suggesting a severe sign problem. 



is the effective four-fermion interaction term at each dual 
site x. The symbol [x,x] denotes the set of all lattice sites 
surrounding the dual site x (see Fig. [TJ . Thus, each in- 
tegral generates many four-fermion couplings of the form 
Xi( x )Xii x )Xj(y)Xj{y) where i and j are arbitrary flavor in- 
dices and x and y are corners of the cube surrounding the dual 
site x. We can classify the possible couplings into four types 
based on the bonds (xy) connecting the corners x and y. If 
the two corners are the same we refer to it as a site-bond or a 
5-bond. If the two corners are the two neighboring sites we 
get a L-bond (or a link-bond). Similarly, if the two corners are 
across a face diagonal or a body diagonal, we call the bonds 
F-bond and S-bond respectively. These four bond types are 
illustrated Fig. [3] 




FIG. 3. An illustration of the four types of four-fermion couplings 
(or bonds) generated through the auxiliary field integration. From 
left to right we have a S, L, F and B bond respectively. 

Integration over all the auxiliary field variables yields 
the four-fermion interaction term of the action Sz 2 , int = 
Si(x). Collecting the terms in each of the four types of 
four fermion couplings separately we see that 

Sz 2 ,int = U S B S + U L B L + U F B F + U B B B (13) 
where U s /4 = U L /4= U F /2 = U B = .g 2 /(64/V) and 

£ b0 nd= V x.i(x)xi{x)Xj{y)Xj{y)- (14) 



i ,j, {xy) (Ebond 



Based on the above results, the partition function of the Z2 
model can be rewritten as 



Zz 2 = J HldXidXi] e" 



(15) 



where Sz 2 = <So + Sz 2 , int is the equivalent four-fermion 
action of the model. Here So = Y^ x ,y,i Xi( x ) D x,yXi(y) is the 
free fermion action. 

In the fermion bag approach, each four-fermion coupling is 
represented as a bond and expanded in powers of the cou- 
pling. For example the four-fermion coupling of the type 



Xi{Xp)Xi(xp)Xj(xq)Xj(xq) can be denoted by the bond vari- 
able bij(x p , x q ) — 0, 1, such that if it is then no bond is 
assumed to exist between the sites x p and x q , otherwise the 
specific four-fermion coupling is inserted in the partition func- 
tion. Due to the Grassmann nature of the couplings higher 
powers of the couplings do not exist. More details can be 
found in ifTTl . Thus, in the fermion bag formulation, the parti- 
tion function can be written as a sum over these bond config- 
urations [b], such that 

zz 2 = Y, u s s u n L L urur f Hm.dxi] e" s ° 

lb] 17 i 

x YiXt )Xi (%i 2 )-Xi {x lkt )Xi (xt kt ) 

i 

- E U s S u l l ur U n B B { II Ci ( Xil , .., x iki ) }(16) 

lb] 

where ns,riL,nF and n B are the total number of bonds of 
each type and the correlation function Ci(xi 1 , .., Xi k ) was de- 
fined in Eq.([9]l. A given bond configuration [b] uniquely de- 
termines the hi sites x\ x ....Xi h . (ordered in a consistent way). 
Since we argued above that C^x^, ..,Xi k .) > there is no 
sign problem in this expansion of the partition function for all 
non-negative values of Us,Ul,Uf, U b , any positive integer 
N and real mass m. 

In the case of the U(l) model, we need to integrate over 
both the auxiliary fields a(x),ir(x) on every dual site. It is 
straightforward to verify that 

x e" i2 T i (^'-[-.-] £ ( x )Xi(x)xi(x)) = j^f e -s I {x) 
where TV = (47rg 2 /7V) and 

2 



Si(x) 



6AN 



i,[x,x] 

~[ E £ ( x )Xt(x)Xi(3 



(18) 



Interestingly, the four-fermion couplings of the type S and F 
get canceled between the two terms in the above equation. On 
the other hand couplings of the type L and B survive so that 
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the four-fermion action for the U(l) model turns out to be 

Su(i) = 5*0 + U L B L + U B B B (19) 

with U L /4 = U B = g 2 /(l6N). Thus, the only difference 
between the Z 2 and £7(1) models is that the couplings Us — 
Uf = in the U(l) model. Indeed these couplings break the 
U(l) symmetry to a Z 2 symmetry as can be easily verified. 
Since we already proved that the sign problem in the Z 2 model 
was absent for all non-negative values of Us, Ul, Uf, Ub 
and N in the fermion bag formulation, the same is true for the 
17(1) model as well. 

V. CONCLUSIONS 

The fermion bag approach provides an alternative approach 
to fermion field theories where solutions to new sign prob- 
lems emerge naturally. Here we have demonstrated that some 
sign problems in the auxiliary field formulation of GN mod- 
els, especially with Z 2 and U(l) chiral symmetries, disappear 
in the fermion bag approach. While we have not shown here, 
we can solve sign problems in some lattice field theories con- 
taining both dynamical boson and fermion fields with similar 
chiral symmetries. In these more complex models, the so- 
lutions emerge when bosons are formulated in the world-line 



approach and the fermions are formulated in the bag approach. 
Such an approach to quantum field theories was proposed in 

GUI. 

Sign problems in other fermion models with more complex 
symmetries are also solvable in the fermion bag approach. 
However, in many interesting cases the Boltzmann weight of a 
fermion bag, although non-negative, turns out to be a fermio- 
nant instead of a determinant [31"]. Since the computation of 
the fermionant can be exponentially hard, the fermion bag ap- 
proach loses its practical appeal in such cases. Still, we be- 
lieve that there are many other interesting models where the 
weight of the fermion bag continues to be positive and com- 
putable with polynomial effort. 
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